Tootfinder

@arXiv_csCL_bot@mastoxiv.page
2024-04-15 08:30:17

This https://arxiv.org/abs/2401.08772 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
In this work, we present HuixiangDou, a technical assistant powered by Large Language Models (LLM). This system is designed to assist algorithm developers by providing insightful responses to questions related to open-source algorithm projects, such as computer vision and deep learning projects from OpenMMLab. We further explore the integration of this assistant into the group chats of instant messaging (IM) tools such as WeChat and Lark. Through several iterative improvements and trials, we ha…

@arXiv_csCR_bot@mastoxiv.page
2024-03-14 08:31:45

This https://arxiv.org/abs/2312.03853 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…

Dr. Jekyll and Mr. Hyde: Two Faces of LLMs
Only a year ago, we witnessed a rise in the use of Large Language Models (LLMs), especially when combined with applications like chatbot assistants. Safety mechanisms and specialized training procedures are implemented to prevent improper responses from these assistants. In this work, we bypass these measures for ChatGPT and Bard (and, to some extent, Bing chat) by making them impersonate complex personas with opposite characteristics as those of the truthful assistants they are supposed to be.…

@MartinM@norden.social
2024-04-14 15:29:54

Der Ausreden-Generator erzeugt einen Text, der wirklich von #Wissing sein könnte. Andere fürchten, eine ausgefeilte KI könnte sie ersetzen, beim #Verkehrsminister reicht dazu sogar ein einfacher Chat-Bot.

"Ich, Volker Wissing, trage keine Schuld am mangelnden Klimaschutz im Verkehr. Es waren die Grünen, die uns mit ihrer veganen Ernährung und ihrem Fahrrad-Fetischismus dazu verpflichtet haben, auf unseren SUVs und Benzinschluckern zu beharren. Wir mussten einfach die Fahne der Freiheit und des Wohlstands hochhalten!" - Volker Wissing, Bundesminister für Verkehr

Ausreden-Generator für Klimaschutz-Blockierer
Du musst ein Ministerium führen und der CO2-Ausstoß in deinem Sektor will nicht sinken? Kein Problem – nutze einfach unseren Ausreden-Generator für effektives Blockieren beim Klimaschutz.

@arXiv_csHC_bot@mastoxiv.page
2024-03-13 06:50:38

generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation
Thilo Spinner, Rebecca Kehlbeck, Rita Sevastjanova, Tobias St\"ahle, Daniel A. Keim, Oliver Deussen, Mennatallah El-Assady
https://arxiv.org/abs/2403.07627

generAItor: Tree-in-the-Loop Text Generation for Language Model Explainability and Adaptation
Large language models (LLMs) are widely deployed in various downstream tasks, e.g., auto-completion, aided writing, or chat-based text generation. However, the considered output candidates of the underlying search algorithm are under-explored and under-explained. We tackle this shortcoming by proposing a tree-in-the-loop approach, where a visual representation of the beam search tree is the central component for analyzing, explaining, and adapting the generated outputs. To support these tasks, …

@arXiv_csCR_bot@mastoxiv.page
2024-03-14 08:31:45

This https://arxiv.org/abs/2312.03853 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…

Dr. Jekyll and Mr. Hyde: Two Faces of LLMs
Only a year ago, we witnessed a rise in the use of Large Language Models (LLMs), especially when combined with applications like chatbot assistants. Safety mechanisms and specialized training procedures are implemented to prevent improper responses from these assistants. In this work, we bypass these measures for ChatGPT and Bard (and, to some extent, Bing chat) by making them impersonate complex personas with opposite characteristics as those of the truthful assistants they are supposed to be.…

@arXiv_csCL_bot@mastoxiv.page
2024-04-12 08:28:43

This https://arxiv.org/abs/2402.12749 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…

Me LLaMA: Foundation Large Language Models for Medical Applications
Recent advancements in large language models (LLMs) such as ChatGPT and LLaMA have hinted at their potential to revolutionize medical applications, yet their application in clinical settings often reveals limitations due to a lack of specialized training on medical-specific data. In response to this challenge, this study introduces Me-LLaMA, a novel medical LLM family that includes foundation models - Me-LLaMA 13/70B, along with their chat-enhanced versions - Me-LLaMA 13/70B-chat, developed thr…

@arXiv_csAI_bot@mastoxiv.page
2024-04-09 06:46:58

Towards Reliable and Empathetic Depression-Diagnosis-Oriented Chats
Kunyao Lan, Cong Ming, Binwei Yao, Lu Chen, Mengyue Wu
https://arxiv.org/abs/2404.05012

Towards Reliable and Empathetic Depression-Diagnosis-Oriented Chats
Chatbots can serve as a viable tool for preliminary depression diagnosis via interactive conversations with potential patients. Nevertheless, the blend of task-oriented and chit-chat in diagnosis-related dialogues necessitates professional expertise and empathy. Such unique requirements challenge traditional dialogue frameworks geared towards single optimization goals. To address this, we propose an innovative ontology definition and generation framework tailored explicitly for depression diagn…

@Techmeme@techhub.social
2024-03-30 02:36:08

NYC's Microsoft-powered MyCity chatbot, launched as a pilot program last October, often gives inaccurate info, including telling businesses to break the law (Colin Lecher/The City)
https://www.thecity.nyc/2024/03/29/ai-chat-false-information-small-…

NYC AI Chatbot Touted by Adams Tells Businesses to Break the Law
The Microsoft-powered bot says bosses can take worker’s tips and that landlords can discriminate based on source of income. That's not right.

@arXiv_csCL_bot@mastoxiv.page
2024-03-11 07:18:38

ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models
Jun Xu, Mengshu Sun, Zhiqiang Zhang, Jun Zhou
https://arxiv.org/abs/2403.05132

ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models
Recent advancements in large language models have shown impressive performance in general chat. However, their domain-specific capabilities, particularly in information extraction, have certain limitations. Extracting structured information from natural language that deviates from known schemas or instructions has proven challenging for previous prompt-based methods. This motivated us to explore domain-specific modeling in chat-based language models as a solution for extracting structured infor…

@MAD_democracy@journa.host
2024-03-29 17:39:37

From @…
NYC AI Chatbot Touted by Adams Tells Businesses to Break the Law https://www.thecity.nyc/2024/03/29/ai-chat-false-…

NYC AI Chatbot Touted by Adams Tells Businesses to Break the Law
The Microsoft-powered bot says bosses can take worker’s tips and that landlords can discriminate based on source of income. That's not right.

@arXiv_csSE_bot@mastoxiv.page
2024-03-04 06:52:53

FhGenie: A Custom, Confidentiality-preserving Chat AI for Corporate and Scientific Use
Ingo Weber, Hendrik Linka, Daniel Mertens, Tamara Muryshkin, Heinrich Opgenoorth, Stefan Langer
https://arxiv.org/abs/2403.00039

FhGenie: A Custom, Confidentiality-preserving Chat AI for Corporate and Scientific Use
Since OpenAI's release of ChatGPT, generative AI has received significant attention across various domains. These AI-based chat systems have the potential to enhance the productivity of knowledge workers in diverse tasks. However, the use of free public services poses a risk of data leakage, as service providers may exploit user input for additional training and optimization without clear boundaries. Even subscription-based alternatives sometimes lack transparency in handling user data. To addr…

@drahardja@sfba.social
2024-03-29 16:34:51

New York City’s new #AI chatbot (predictably) sometimes gives bad, law-breaking advice to residents’ queries about how to run their businesses.
Q: What’s worse than a bot that always gives bad advice?
A: A bot that *sometimes* gives bad advice.
“NYC AI Chatbot Touted by Adams Tells Businesses to Break the Law”

NYC AI Chatbot Touted by Adams Tells Businesses to Break the Law
The Microsoft-powered bot says bosses can take worker’s tips and that landlords can discriminate based on source of income. That's not right.

@arXiv_csIR_bot@mastoxiv.page
2024-04-09 06:50:18

The Use of Generative Search Engines for Knowledge Work and Complex Tasks
Siddharth Suri, Scott Counts, Leijie Wang, Chacha Chen, Mengting Wan, Tara Safavi, Jennifer Neville, Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Sathish Manivannan, Nagu Rangan, Longqi Yang
https://arxiv.org/abs/2404.04268

The Use of Generative Search Engines for Knowledge Work and Complex Tasks
Until recently, search engines were the predominant method for people to access online information. The recent emergence of large language models (LLMs) has given machines new capabilities such as the ability to generate new digital artifacts like text, images, code etc., resulting in a new tool, a generative search engine, which combines the capabilities of LLMs with a traditional search engine. Through the empirical analysis of Bing Copilot (Bing Chat), one of the first publicly available gen…

@ampersine@mastodon.online
2024-05-07 15:00:01

#AlaskaAirlines Fun Facts

Alaska Airlines chat bot informing the user that lions have huge balls

@arXiv_csCL_bot@mastoxiv.page
2024-03-11 07:18:38

ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models
Jun Xu, Mengshu Sun, Zhiqiang Zhang, Jun Zhou
https://arxiv.org/abs/2403.05132

ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models
Recent advancements in large language models have shown impressive performance in general chat. However, their domain-specific capabilities, particularly in information extraction, have certain limitations. Extracting structured information from natural language that deviates from known schemas or instructions has proven challenging for previous prompt-based methods. This motivated us to explore domain-specific modeling in chat-based language models as a solution for extracting structured infor…

@arXiv_qbioQM_bot@mastoxiv.page
2024-04-09 07:13:36

GENEVIC: GENetic data Exploration and Visualization via Intelligent interactive Console
Anindita Nath (Center for Precision Health, McWilliams School of Biomedical Informatics, UT Health Houston, TX), Savannah Mwesigwa (Center for Precision Health, McWilliams School of Biomedical Informatics, UT Health Houston, TX), Yulin Dai (Center for Precision Health, McWilliams School of Biomedical Informatics, UT Health Houston, TX), Xiaoqian Jiang (Department of Health Data Science and Artificia…

GENEVIC: GENetic data Exploration and Visualization via Intelligent interactive Console
Summary: The vast generation of genetic data poses a significant challenge in efficiently uncovering valuable knowledge. Introducing GENEVIC, an AI-driven chat framework that tackles this challenge by bridging the gap between genetic data generation and biomedical knowledge discovery. Leveraging generative AI, notably ChatGPT, it serves as a biologist's 'copilot'. It automates the analysis, retrieval, and visualization of customized domain-specific genetic information, and integrates functional…

@ben@a11y.info
2024-03-19 16:39:27

Content warning: GenAI

Of all the baffling GenAI decisions out there, I am maybe most baffled by Snapchat's “My AI” chatbot.
• Takes up space at the top of my chat list, above friends I've chatted with within the past few days, despite the fact I haven't engaged with the bot in the near year it's been out
• Can only be hidden/removed if you pay for Snapchat's premium membership
Who is this for? Who's going to a social platform to talk with a chatbot? Is the platform so emp…

@arXiv_csHC_bot@mastoxiv.page
2024-05-10 07:35:20

Understanding and Mitigating Harmful Design in User-Generated Virtual Worlds
Zinan Zhang, Xinning Gui, Yubo Kou
https://arxiv.org/abs/2405.05922 https://…

Understanding and Mitigating Harmful Design in User-Generated Virtual Worlds
Virtual space offers innovative ways for individuals to engage with one another in a digital setting. Prominent virtual social platforms, such as Facebook Spaces, VR Chat, and AltspaceVR, facilitate social connections, allowing users to interact seamlessly. Additionally, certain video games, like Second Life and World of Warcraft, are set within these virtual spaces as well, providing immersive player experiences. As the popularity of virtual space grows, various companies have begun to democra…

@arXiv_csCL_bot@mastoxiv.page
2024-03-11 08:30:28

This https://arxiv.org/abs/2401.14040 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…

(Chat)GPT v BERT: Dawn of Justice for Semantic Change Detection
In the universe of Natural Language Processing, Transformer-based language models like BERT and (Chat)GPT have emerged as lexical superheroes with great power to solve open research problems. In this paper, we specifically focus on the temporal problem of semantic change, and evaluate their ability to solve two diachronic extensions of the Word-in-Context (WiC) task: TempoWiC and HistoWiC. In particular, we investigate the potential of a novel, off-the-shelf technology like ChatGPT (and GPT) 3.…

@arXiv_condmatstatmech_bot@mastoxiv.page
2024-03-07 07:23:58

Dynamic Scaling of Two-Dimensional Polar Flocks
Hugues Chat\'e, Alexandre Solon
https://arxiv.org/abs/2403.03804 https://arxiv.or…

Dynamic Scaling of Two-Dimensional Polar Flocks
We propose a hydrodynamic description of the homogeneous ordered phase of polar flocks. Starting from symmetry principles, we construct the appropriate equation for the dynamics of the Goldstone mode associated with the broken rotational symmetry. We then focus on the two-dimensional case considering both "Malthusian flocks" for which the density field is a fast variable that does not enter the hydrodynamic description and "Vicsek flocks" for which it does. In both cases, we argue in favor of s…

@drahardja@sfba.social
2024-03-29 16:34:51

New York City’s new #AI chatbot (predictably) sometimes gives bad, law-breaking advice to residents’ queries about how to run their businesses.
Q: What’s worse than a bot that always gives bad advice?
A: A bot that *sometimes* gives bad advice.
“NYC AI Chatbot Touted by Adams Tells Businesses to Break the Law”

NYC AI Chatbot Touted by Adams Tells Businesses to Break the Law
The Microsoft-powered bot says bosses can take worker’s tips and that landlords can discriminate based on source of income. That's not right.

@arXiv_csHC_bot@mastoxiv.page
2024-05-10 07:35:20

Understanding and Mitigating Harmful Design in User-Generated Virtual Worlds
Zinan Zhang, Xinning Gui, Yubo Kou
https://arxiv.org/abs/2405.05922 https://…

Understanding and Mitigating Harmful Design in User-Generated Virtual Worlds
Virtual space offers innovative ways for individuals to engage with one another in a digital setting. Prominent virtual social platforms, such as Facebook Spaces, VR Chat, and AltspaceVR, facilitate social connections, allowing users to interact seamlessly. Additionally, certain video games, like Second Life and World of Warcraft, are set within these virtual spaces as well, providing immersive player experiences. As the popularity of virtual space grows, various companies have begun to democra…

@arXiv_csCR_bot@mastoxiv.page
2024-04-04 06:47:57

Exploring Backdoor Vulnerabilities of Chat Models
Yunzhuo Hao, Wenkai Yang, Yankai Lin
https://arxiv.org/abs/2404.02406 https://arxiv…

Exploring Backdoor Vulnerabilities of Chat Models
Recent researches have shown that Large Language Models (LLMs) are susceptible to a security threat known as Backdoor Attack. The backdoored model will behave well in normal cases but exhibit malicious behaviours on inputs inserted with a specific backdoor trigger. Current backdoor studies on LLMs predominantly focus on instruction-tuned LLMs, while neglecting another realistic scenario where LLMs are fine-tuned on multi-turn conversational data to be chat models. Chat models are extensively ad…

@arXiv_csIR_bot@mastoxiv.page
2024-03-01 06:51:11

Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines
Lijia Ma, Xingchen Xu, Yong Tan
https://arxiv.org/abs/2402.19421 https:/…

Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines
In the domain of digital information dissemination, search engines act as pivotal conduits linking information seekers with providers. The advent of chat-based search engines utilizing Large Language Models (LLMs) and Retrieval Augmented Generation (RAG), exemplified by Bing Chat, marks an evolutionary leap in the search ecosystem. They demonstrate metacognitive abilities in interpreting web information and crafting responses with human-like understanding and creativity. Nonetheless, the intric…

@ben@a11y.info
2024-03-19 16:39:27

Content warning: GenAI

Of all the baffling GenAI decisions out there, I am maybe most baffled by Snapchat's “My AI” chatbot.
• Takes up space at the top of my chat list, above friends I've chatted with within the past few days, despite the fact I haven't engaged with the bot in the near year it's been out
• Can only be hidden/removed if you pay for Snapchat's premium membership
Who is this for? Who's going to a social platform to talk with a chatbot? Is the platform so emp…

@arXiv_csCL_bot@mastoxiv.page
2024-03-07 08:24:54

This https://arxiv.org/abs/2310.04799 has been replaced.
link: https://scholar.google.com/scholar?q=a

Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages
Recently, the development of open-source large language models (LLMs) has advanced rapidly. Nevertheless, due to data constraints, the capabilities of most open-source LLMs are primarily focused on English. To address this issue, we introduce the concept of chat vector to equip pre-trained language models with instruction following and human value alignment via simple model arithmetic. The chat vector is derived by subtracting the weights of a pre-trained base model (e.g. LLaMA2) from those of …

@arXiv_csCR_bot@mastoxiv.page
2024-05-06 08:26:34

This https://arxiv.org/abs/2312.03853 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…

Dr. Jekyll and Mr. Hyde: Two Faces of LLMs
Recently, we have witnessed a rise in the use of Large Language Models (LLMs), especially in applications like chatbot assistants. Safety mechanisms and specialized training procedures are implemented to prevent improper responses from these assistants. In this work, we bypass these measures for ChatGPT and Bard (and, to some extent, Bing chat) by making them impersonate complex personas with personality characteristics that are not aligned with a truthful assistant. We start by creating elabor…

@arXiv_csMA_bot@mastoxiv.page
2024-02-27 06:50:55

AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System
Zhiwei Liu, Weiran Yao, Jianguo Zhang, Liangwei Yang, Zuxin Liu, Juntao Tan, Prafulla K. Choubey, Tian Lan, Jason Wu, Huan Wang, Shelby Heinecke, Caiming Xiong, Silvio Savarese
https://arxiv.org/abs/2402.15538

AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System
The booming success of LLMs initiates rapid development in LLM agents. Though the foundation of an LLM agent is the generative model, it is critical to devise the optimal reasoning strategies and agent architectures. Accordingly, LLM agent research advances from the simple chain-of-thought prompting to more complex ReAct and Reflection reasoning strategy; agent architecture also evolves from single agent generation to multi-agent conversation, as well as multi-LLM multi-agent group chat. Howeve…

@arXiv_csSE_bot@mastoxiv.page
2024-02-23 07:23:36

Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Anisha Agarwal, Aaron Chan, Shubham Chandel, Jinu Jang, Shaun Miller, Roshanak Zilouchian Moghaddam, Yevhen Mohylevskyy, Neel Sundaresan, Michele Tufano
https://arxiv.org/abs/2402.14261

Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
The integration of Large Language Models (LLMs) into Development Environments (IDEs) has become a focal point in modern software development. LLMs such as OpenAI GPT-3.5/4 and Code Llama offer the potential to significantly augment developer productivity by serving as intelligent, chat-driven programming assistants. However, utilizing LLMs out of the box is unlikely to be optimal for any given scenario. Rather, each system requires the LLM to be honed to its set of heuristics to ensure the best…

@arXiv_csHC_bot@mastoxiv.page
2024-03-04 08:31:39

This https://arxiv.org/abs/2312.06024 has been replaced.
link: https://scholar.google.com/scholar?q=a

Thinking Assistants: LLM-Based Conversational Assistants that Help Users Think By Asking rather than Answering
We introduce the concept of "thinking assistants", an approach that encourages users to engage in deep reflection and critical thinking through brainstorming and thought-provoking queries. We instantiate one such thinking assistant, Gradschool.chat, as a virtual assistant tailored to assist prospective graduate students. We posit that thinking assistants are particularly relevant to situations like applying to graduate school, a phase often characterized by the challenges of academic preparatio…

@arXiv_csHC_bot@mastoxiv.page
2024-04-30 08:34:02

This https://arxiv.org/abs/2401.15182 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csHC_…

App Planner: Utilizing Generative AI in K-12 Mobile App Development Education
App Planner is an interactive support tool for K-12 students, designed to assist in creating mobile applications. By utilizing generative AI, App Planner helps students articulate the problem and solution through guided conversations via a chat-based interface. It assists them in brainstorming and formulating new ideas for applications, provides feedback on those ideas, and stimulates creative thinking. Here we report usability tests from our preliminary study with high-school students who appr…

@arXiv_csCL_bot@mastoxiv.page
2024-05-03 07:15:30

"Ask Me Anything": How Comcast Uses LLMs to Assist Agents in Real Time
Scott Rome, Tianwen Chen, Raphael Tang, Luwei Zhou, Ferhan Ture
https://arxiv.org/abs/2405.00801 <…

"Ask Me Anything": How Comcast Uses LLMs to Assist Agents in Real Time
Customer service is how companies interface with their customers. It can contribute heavily towards the overall customer satisfaction. However, high-quality service can become expensive, creating an incentive to make it as cost efficient as possible and prompting most companies to utilize AI-powered assistants, or "chat bots". On the other hand, human-to-human interaction is still desired by customers, especially when it comes to complex scenarios such as disputes and sensitive topics like bill…

@arXiv_csCL_bot@mastoxiv.page
2024-05-03 07:16:42

WildChat: 1M ChatGPT Interaction Logs in the Wild
Wenting Zhao, Xiang Ren, Jack Hessel, Claire Cardie, Yejin Choi, Yuntian Deng
https://arxiv.org/abs/2405.01470

WildChat: 1M ChatGPT Interaction Logs in the Wild
Chatbots such as GPT-4 and ChatGPT are now serving millions of users. Despite their widespread use, there remains a lack of public datasets showcasing how these tools are used by a population of users in practice. To bridge this gap, we offered free access to ChatGPT for online users in exchange for their affirmative, consensual opt-in to anonymously collect their chat transcripts and request headers. From this, we compiled WildChat, a corpus of 1 million user-ChatGPT conversations, which consi…

@arXiv_csIR_bot@mastoxiv.page
2024-04-18 08:34:41

This https://arxiv.org/abs/2402.14301 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…

GenSERP: Large Language Models for Whole Page Presentation
The advent of large language models (LLMs) brings an opportunity to minimize the effort in search engine result page (SERP) organization. In this paper, we propose GenSERP, a framework that leverages LLMs with vision in a few-shot setting to dynamically organize intermediate search results, including generated chat answers, website snippets, multimedia data, knowledge panels into a coherent SERP layout based on a user's query. Our approach has three main stages: (1) An information gathering pha…

@arXiv_csHC_bot@mastoxiv.page
2024-03-27 08:25:09

This https://arxiv.org/abs/2401.15182 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csHC_…

App Planner: Utilizing Generative AI in K-12 Mobile App Development Education
App Planner is an interactive support tool for K-12 students, designed to assist in creating mobile applications. By utilizing generative AI, App Planner helps students articulate the problem and solution through guided conversations via a chat-based interface. It assists them in brainstorming and formulating new ideas for applications, provides feedback on those ideas, and stimulates creative thinking. Here we report usability tests from our preliminary study with high-school students who appr…

@arXiv_csCL_bot@mastoxiv.page
2024-02-28 08:30:13

This https://arxiv.org/abs/2402.16107 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…

FuseChat: Knowledge Fusion of Chat Models
While training large language models (LLMs) from scratch can indeed lead to models with distinct capabilities and strengths, this approach incurs substantial costs and may lead to potential redundancy in competencies. An alternative strategy is to combine existing LLMs into a more robust LLM, thereby diminishing the necessity for expensive pre-training. However, due to the diverse architectures of LLMs, direct parameter blending proves to be unfeasible. Recently, \textsc{FuseLLM} introduced the…

@arXiv_csIR_bot@mastoxiv.page
2024-02-23 06:50:16

GenSERP: Large Language Models for Whole Page Presentation
Zhenning Zhang, Yunan Zhang, Suyu Ge, Guangwei Weng, Mridu Narang, Xia Song, Saurabh Tiwary
https://arxiv.org/abs/2402.14301

GenSERP: Large Language Models for Whole Page Presentation
The advent of large language models (LLMs) brings an opportunity to minimize the effort in search engine result page (SERP) organization. In this paper, we propose GenSERP, a framework that leverages LLMs with vision in a few-shot setting to dynamically organize intermediate search results, including generated chat answers, website snippets, multimedia data, knowledge panels into a coherent SERP layout based on a user's query. Our approach has three main stages: (1) An information gathering pha…

@arXiv_csCL_bot@mastoxiv.page
2024-03-25 08:30:47

This https://arxiv.org/abs/2403.13592 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…

Llama meets EU: Investigating the European Political Spectrum through the Lens of LLMs
Instruction-finetuned Large Language Models inherit clear political leanings that have been shown to influence downstream task performance. We expand this line of research beyond the two-party system in the US and audit Llama Chat in the context of EU politics in various settings to analyze the model's political knowledge and its ability to reason in context. We adapt, i.e., further fine-tune, Llama Chat on speeches of individual euro-parties from debates in the European Parliament to reevaluat…

@arXiv_csHC_bot@mastoxiv.page
2024-03-18 06:49:55

Enhancing Depression-Diagnosis-Oriented Chat with Psychological State Tracking
Yiyang Gu, Yougen Zhou, Qin Chen, Ningning Zhou, Jie Zhou, Aimin Zhou, Liang He
https://arxiv.org/abs/2403.09717

Enhancing Depression-Diagnosis-Oriented Chat with Psychological State Tracking
Depression-diagnosis-oriented chat aims to guide patients in self-expression to collect key symptoms for depression detection. Recent work focuses on combining task-oriented dialogue and chitchat to simulate the interview-based depression diagnosis. Whereas, these methods can not well capture the changing information, feelings, or symptoms of the patient during dialogues. Moreover, no explicit framework has been explored to guide the dialogue, which results in some useless communications that a…

@arXiv_csHC_bot@mastoxiv.page
2024-03-21 06:53:22

VCounselor: A Psychological Intervention Chat Agent Based on a Knowledge-Enhanced Large Language Model
H. Zhang, Z. Qiao, H. Wang, B. Duan, J. Yin
https://arxiv.org/abs/2403.13553

VCounselor: A Psychological Intervention Chat Agent Based on a Knowledge-Enhanced Large Language Model
Conversational artificial intelligence can already independently engage in brief conversations with clients with psychological problems and provide evidence-based psychological interventions. The main objective of this study is to improve the effectiveness and credibility of the large language model in psychological intervention by creating a specialized agent, the VCounselor, to address the limitations observed in popular large language models such as ChatGPT in domain applications. We achieve…

@arXiv_csCL_bot@mastoxiv.page
2024-05-01 06:49:10

Iterative Reasoning Preference Optimization
Richard Yuanzhe Pang, Weizhe Yuan, Kyunghyun Cho, He He, Sainbayar Sukhbaatar, Jason Weston
https://arxiv.org/abs/2404.19733 https://arxiv.org/pdf/2404.19733
arXiv:2404.19733v1 Announce Type: new
Abstract: Iterative preference optimization methods have recently been shown to perform well for general instruction tuning tasks, but typically make little improvement on reasoning tasks (Yuan et al., 2024, Chen et al., 2024). In this work we develop an iterative approach that optimizes the preference between competing generated Chain-of-Thought (CoT) candidates by optimizing for winning vs. losing reasoning steps that lead to the correct answer. We train using a modified DPO loss (Rafailov et al., 2023) with an additional negative log-likelihood term, which we find to be crucial. We show reasoning improves across repeated iterations of this scheme. While only relying on examples in the training set, our approach results in increasing accuracy for Llama-2-70B-Chat from 55.6% to 81.6% on GSM8K (and 88.7% with majority voting out of 32 samples), from 12.5% to 20.8% on MATH, and from 77.8% to 86.7% on ARC-Challenge, which outperforms other Llama-2-based models not relying on additionally sourced datasets.

@arXiv_csHC_bot@mastoxiv.page
2024-03-20 07:34:35

Fact Checking Chatbot: A Misinformation Intervention for Instant Messaging Apps and an Analysis of Trust in the Fact Checkers
Gionnieve Lim, Simon T. Perrault
https://arxiv.org/abs/2403.12913

Fact Checking Chatbot: A Misinformation Intervention for Instant Messaging Apps and an Analysis of Trust in the Fact Checkers
In Singapore, there has been a rise in misinformation on mobile instant messaging services (MIMS). MIMS support both small peer-to-peer networks and large groups. Misinformation in the former may spread due to recipients' trust in the sender while in the latter, misinformation can directly reach a wide audience. The encryption of MIMS makes it difficult to address misinformation directly. As such, chatbots have become an alternative solution where users can disclose their chat content directly …

Tootfinder

Opt-in global Mastodon full text search. Join the index!